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15 BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates in general to computer networks and more 
particularly to a parallel data transfer method and system for managing the transfer of 
data in parallel through multiple sockets. 

20 

2. Related Art 

Computer networks are widespread and vitally important to many types of 
enterprises including businesses, universities and government. In general, a computer 
network is two or more computers (or associated devices) that are connected by 

25 communication facilities. One type of computer network is a client/server network, A 
client/server network includes a server, which is a computer or a process that provides 
shared resources to users of the network, and a client, which is a computer or a process 
that accesses the shared network resources provided by the server using the 
communication facilities. 

30 In general, a client in a client/server network obtains information from a server by 

sending a request to the server. When the server receives the request, a server 
application on the server fills the request by obtaining the requested information and 
sending the information through the network to the client. The Internet (via the World 
Wide Web (WWW)) is an example of a client/server network. The internet is a public 

35 wide-area network (WAN) environment that enables a client to request and receive data 
located on a remote server. 
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The server computer includes a network adapter card that physically transmits 
and receives packets of data between the server computer and a client computer over 
the network. The server computer also includes server applications that are software for 
manipulating data. For example, server applications may include database programs, 
5 spreadsheets and browsers. Each server application can access the network through 
the network adapter card. The network adapter card is only a single physical connection 
between the server computer and the network, but can support multiple virtual 
connections between the multiple server applications and the card. 

Each virtual connection between a server application and the network adapter 
1 0 may be divided into three elements. A virtual port is at the network adapter card side of 
the virtual connection. The network adapter card normally can support multiple virtual 
ports. An application process (also called a "thread") is on the other side of the virtual 
connection. When the application is a server application, the application process is 
J called a server process. The thread can be processed by a processor on the server 

63 15 computer. Each server application may be running several threads, and multiple threads 
?? may be processed by the processor simultaneously. 

Ly A socket is used to connect a thread to a virtual port. A socket is a software 

r *f object that allows a thread of a server application to communicate with a virtual port of 

^ the network adapter card. The procedure of creating a socket between a thread of a 

O 20 server application and a virtual port of the network adapter card is known as "binding" 
%l the server application to the socket. When a server application is "bound" to a socket, a 

g! connection is made between a thread of the server application and a virtual port of the 

network adapter card. When this connection is established, the server application is 
able to communicate with the network adapter card and receive and transmit data over 
25 the network. 

Current socket techniques create a socket and bind a server application to the 
socket each time a client makes a request to the server. The server application stays 
bound to the socket from the time the client request is received to the time the server 
application sends a response to the client. In other words, a single server application 
30 monopolizes the socket until the client request is fulfilled. The server receives a client 
request, determines to which server application the client request is directed, creates a 
socket, bind the server application to the socket, processes the data and send the data 
to the client to fulfill the client request. All during this time the server application stays 
connected to the socket until the client request is fulfilled. 
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One problem with these current socket techniques is that multiple server 
applications are contending for the single socket. While a single server application 
monopolizes a socket other server applications are unable to access the network 
adapter card. This becomes a major bottleneck that can cause a reduction in network 
5 performance. 

Some current socket techniques attempt to avoid this problem by allowing two 
sockets to be used. In this socket pair technique, one server application is bound to one 
socket and another server application is bound to the other socket such that a socket 
pair is formed. The socket pair technique uses a socket assignment procedure to 
1 0 determine which server application is assigned to which socket. 

One problem with the socket pair technique is that the socket assignment 
procedure is will assign a socket even if the socket is in use and unavailable. This will 
cause socket contention and force a server application to wait for the socket to become 
i available. For example, the socket assignment procedure in the socket pair technique 

65 1 5 will assign the first socket of the socket pair. If this first socket is being used already by 
rT a server application, then the procedure will keep trying to acquire the socket. After 

yj several attempts the procedure may assign the second socket. However, valuable time 

is wasted and efficiency decreased because current socket assignment procedures 
a always assign the first socket, even if that socket is not available. Thus, current socket 

O 20 pair techniques do little to alleviate socket contention and bottlenecks and to increase 
S| efficiency. 

IP Therefore, what is needed is a method and system for managing the transfer of 

H data using a socket that avoids monopolization of the socket by a single server 

application. What is also needed is a method and system for managing data the transfer 
25 of data through multiple sockets that assigns sockets such that server applications are 
assigned only to available sockets. What also is needed is a method and system for 
managing the transfer of data through multiple sockets that provides multiple socket 
binding and assignment within the framework of existing operating systems. 

30 SUMMARY OF THE INVENTION 

To overcome the limitations in the prior art as described above and other 
limitations that will become apparent upon reading and understanding the present 
specification, the present invention includes a parallel data transfer method and system 
for managing the transfer of data in parallel through multiple sockets. The present 
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invention allows the use of multiple sockets so that monopolization by a single server 
application is avoided. The present invention releases a socket as soon as data has 
been transferred through the socket. This release of the socket means that the socket 
can be bound to other server applications. Thus, when a client request is sent through a 
5 socket to a server application the present invention does not allow the server application 
to monopolize the socket until the client request has been filled. Instead, the socket is 
released after the client request is sent to the server application, thereby making the 
socket available to other server applications. 

In addition, the present invention includes a novel socket assignment technique 

1 0 that only assigns available sockets. Before assignment, the intended socket is checked 
to ensure that the socket is not in use. This novel socket assignment technique avoids 
bottlenecking and delays due to socket contention and increases efficiency. Moreover, 
the parallel data transfer method and system of the present invention is scalable by 
adding more sockets as demand increases. The present invention is also easily 

15 implemented into current operating systems. 

The parallel data transfer method of the present invention allows the efficient 
transfer of data between a computer and a computer network. A virtual connection (or 
socket) is established between an application process (or "thread") on the computer and 
a network adapter card in communication with the computer network. The socket is 

20 used to transfer data between a first application process and the network adapter card. 
When the data is finished transferring, the socket is made available (or released) to 
other application processes. 

The present invention also includes creating multiple sockets to allow parallel 
transfer of data between application processes on the computer and the computer 

25 network. Upon request by an application process, that process is assigned to an 

available socket. In order to determine that the socket is available, a check is made of 
the socket to ensure that the socket is not being used. Socket assignment is performed 
using a variety of socket assignment techniques. One such technique is a round robin 
technique, which assigns available sockets in a round robin manner. Another technique 

30 is a random technique, whereby available sockets are assigned randomly. Moreover, a 
user defined technique may be used whereby an available socket is assigned as defined 
by a user. 

The parallel data transfer system of the present invention includes a parallel 
sockets module that provides parallel data transfer through multiple sockets having 
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application processes bound to each socket. The parallel sockets module includes a 
module that demultiplexes received network data (such as a client request) and also 
multiplexes network data that is transmitted from the application through the socket to 
the network adapter card. In addition, the parallel sockets module includes a binding 
5 module capable of binding an application process to a socket and an assignment 
module that uses a novel socket assignment technique to assign available sockets. 

Other aspects and advantages of the present invention as well as a more 
complete understanding thereof will become apparent from the following detailed 
description, taken in conjunction with the accompanying drawings, illustrating by way of 
1 0 example the principles of the invention. Moreover, it is intended that the scope of the 
invention be limited by the claims and not by the preceding summary or the following 
detailed description. 



BRIEF DESCRIPTION OF THE DRAWINGS 

1 5 The present invention can be further understood by reference to the following 

description and attached drawings that illustrate the preferred embodiments. Other 
features and advantages will be apparent from the following detailed description of the 
invention, taken in conjunction with the accompanying drawings, which illustrate, by way 
of example, the principles of the present invention. 

20 Referring now to the drawings in which like reference numbers represent 

corresponding parts throughout: 

FIG. 1 illustrates a conventional hardware configuration for use with the present 
invention. 

FIG. 2 is a block diagram of an individual computer system of FIG. 1 
25 incorporating the present invention and is shown for illustrative purposes only, 
FIG. 3 is a general block diagram illustrating an overview of the present 
invention. 

FIG. 4 is a general block diagram of the parallel sockets module of the present 
invention shown FIGS. 2 and 3. 
30 FIG. 5 is a flow diagram illustrating the general operation of the present 

invention. 
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DETAILED DESCRIPTION OF THE INVENTION 

In the following description of the invention, reference is made to the 
accompanying drawings, which form a part thereof, and in which is shown by way of 
illustration a specific example whereby the invention may be practiced. It is to be 
5 understood that other embodiments may be utilized and structural changes may be 
made without departing from the scope of the present invention. 

I. Exemplary Operating Environment 

The following discussion is designed to provide a brief, general description of a 

1 0 suitable environment in which the present invention may be implemented. It should be 
noted that FIGS. 1 and 2 depict only one of several ways in which the present invention 
may be implemented. 

FIG. 1 illustrates a conventional hardware configuration for use with the present 
invention. In particular, an enterprise computer system 100 may include one or more 

1 5 networks, such as local area networks (LANs) 1 05 and 1 1 0. Each of the LANs 1 05, 1 1 0 
includes a plurality of individual computers 115, 120, 125, 130, 135, 140, 145 and 150. 
The computers within the LANs 105, 1 10 may be any suitable computer such as, for 
example, a personal computer made by International Business Machines (IBM) 
Corporation, located in Armonk, New York. Typically, each of the plurality of individual 

20 computers is coupled to storage devices 155, 156, 157, 158 and 159 (such as a disk 
drive or hard disk) that may be used to store data (such as modules of the present 
invention) and computer-executable instructions in accordance with the present 
invention. Each of the plurality of individual computers 115, 120, 125, 130, 135, 140, 
145, 150 also may be coupled to an output device 160 (such as a printer) for producing 

25 tangible output. The LANs 105, 1 10 may be coupled via a first communication link 165 
to a communication controller 170, and from the communication controller 170 through a 
second communication link 175 to a gateway server 180. The gateway server 180 is 
preferably a personal computer that serves to link the LAN 105 to the LAN 110. 

The computer system 100 may also include a plurality of mainframe computers, 

30 such as a mainframe computer 185, which may be in communication with one or more of 
the LANs 105, 1 10 by means of a third communication link 190. The mainframe 
computer 185 is typically coupled to a storage device 195 that is capable of serving as a 
remote storage for one or more of the LANs 105, 110. Similar to the LANs 105, 1 10 
discussed above, the storage device may be used to store data and computer- 
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executable instructions in accordance with the present invention. Those skilled in the art 
will appreciate that the mainframe computer 185, the LAN 105 and the LAN 110 may be 
physically located a great distance from each other. By way of example, a user may use 
a client system of the mainframe computer 185 to access information located on a 
5 server of the LAN 105. 

FIG. 2 is a block diagram of an individual computer system (such as a network 
server computer) of FIG. 1 incorporating the present invention and is shown for 
illustrative purposes only. A computer 200 includes any suitable central processing unit 
(CPU) 210, such as a standard microprocessor, and any number of other objects 

1 0 interconnected by a system bus 212. For purposes of illustration, the computer 200 
includes memory such as random-access memory (RAM) 214, read-only memory 
(ROM) 216, and storage devices (such as hard disk or disk drives 220) connected to the 
system bus 212 by an input/output (I/O) adapter 218. The computer 200 may be a 
network server that is capable of connecting and interacting with a plurality of client 

1 5 machines over a communication channel (such as a network 221). Moreover, the 

network server is able to receive network requests from the plurality of client machines 
and serve up requested data to the client machines. Accordingly, as shown in FIG. 2, at 
least one of the memory devices (such as the RAM 214, ROM 216, and hard disk or disk 
drives 220) contains a parallel sockets module 222 in accordance with the present 

20 invention that contains computer-executable instructions for carrying out the present 
invention. As explained in detail below, the parallel sockets module 222 enables the 
present invention to manage the transfer data of using multiple sockets, and also 
includes a novel socket assignment technique for assigning a server application to a 
socket. 

25 The computer 200 may further include a display adapter 226 for connecting the 

system bus 212 to a suitable display device 228. In addition, a user interface adapter 
236 is capable of connecting the system bus 212 to other user interface devices, such 
as a keyboard 240, a speaker 246, a mouse 250 and a touchpad (not shown). In a 
preferred embodiment, a graphical user interface (GUI) and an operating system (OS) 

30 reside within a computer-readable media and contain device drivers that allow one or 
more users to manipulate object icons and text on the display device 228. Any suitable 
computer-readable media may retain the GUI and OS, such as, for example, the RAM 
214, ROM 216, hard disk or disk drives 220 (such as magnetic diskette, magnetic tape, 
CD-ROM, optical disk or other suitable storage media). 
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II* General Overview and Components of the Invention 

The parallel sockets module 222 of the present invention provides parallel data 
transfer through multiple sockets. For example, the parallel data transfer may occur 
5 when a server receives client requests from a plurality of client machines and then 
processes each request using different server applications. The present invention 
allows the threads of these different server applications to be bound to a corresponding 
socket. Alternatively, multiple threads of a single server application may be bound to a 
corresponding socket. FIG. 3 is a general block diagram illustrating an overview of the 

1 0 present invention. In particular, a plurality of client machines (client 1 , client 2, client 3 to 
client N) are connected to a server 300 through a network 310. It should be noted that N 
may be any number and even though four clients are shown in FIG. 3, any number of 
clients may be used. 

The plurality of client machines access the server 300 through an access 

1 5 channel 320 that in communication with the network 310. In a preferred embodiment, 
the access channel 320 includes multiple virtual ports that are in communication with a 
network adapter card. In this manner, data may be transferred through a socket through 
the access channel to the network adapter card and out over the network 310. The 
plurality of client machines is able to request and receive data from the server 300 

20 through the access channel 320. The server 300 also includes the parallel sockets 
module 222 of the present invention that enables parallel data transfer through the 
multiple sockets. 

As shown in FIG. 3, multiple sockets (a first socket, a second socket, a third 
socket and a N- socket) are bound to the access channel 320. In addition, each of the 

25 multiple sockets is capable of having at least one corresponding thread that is bound to 
a particular socket. Each of these threads may be associated with a single server 
application or each thread may be associated with a different server application. It 
should be noted that although FIG. 3 illustrates one thread per socket, multiple threads 
per socket could be used. Moreover, there may be situations where sockets are 

30 available and there are no threads bound to a socket. In addition to binding the multiple 
sockets to the access channel 320, the parallel sockets module 222 also uses a novel 
socket assignment technique to assign a thread to an available socket. 

The present invention releases a socket as soon as data has been transferred 
through the socket. As soon as data is finished transferring through a socket the socket 
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is released. This allows the socket to be bound to other server applications. For 
example, when a client request is sent through a socket to a server application the 
present invention does not allow the server application to monopolize the socket until the 
client request has been filled. Instead, the socket is released after the client request is 
5 sent to the server application, thereby making the socket available to other server 
applications. 

FIG. 4 is a general block diagram of the parallel sockets module 222 of the 
present invention shown FIGS. 2 and 3. In particular, the parallel socket module 222 
includes a network data processor 400 that divides network data into separate data units 
1 0 (such as packets). Because the network data is transmitted to and received from the 
network 310 through a network adapter card, both the incoming and outgoing network 
data must be serialized. Thus, the network data processor 400 must demultiplex 
incoming network data (in order to facilitate parallel data transfer) and must multiplex 
J outgoing network data (in order to facilitate its transmission over the network 310). This 

CO 15 demultiplexing is performed by dividing the network data into separate network requests 
(such as client requests). For example, when the network data processor 400 receives 
a plurality of client requests in the form of packets the network data processor 400 
demultiplexes them into separate requests from different clients. Moreover, when 
network data is transmitted from the server 300 through the access channel 320, the 
20 data is multiplexed by the network data processor 400 before being sent out over the 
network 310. 

The parallel sockets module 222 also includes an assignment module 410 for 
assigning a server application thread to an available socket. As explained in detail 
below, the assignment module 410 uses novel assignment technique to assign threads 
25 only to available sockets. Once a thread has been assigned to an available socket, a 
binding module 420 binds the thread to the assigned socket. 

III. Operational Overview and Working Example 

In general, the operation of the present invention provides for the parallel transfer 
30 of data through multiple sockets. The parallelizing of data transfer is accomplished by 
using a multi-thread operating system architecture, allowing multiple sockets and binding 
at least one thread to each available socket as needed. The method of the present 
invention is scalable, provides efficient support for multi-thread operating systems, and 
can be implemented within the framework of existing operating systems. 
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FIG. 5 is a flow diagram illustrating the general operation of the present 
invention. First, network data is divided into separate data units (box 500), such as 
packets. If the network data is incoming from the network adapter card, this step 
demultiplexes the network data into data units. Next, upon request a thread is assigned 
5 to an available one of the multiple sockets (box 510) and the thread is then bound to the 
assigned socket (box 520). Finally, at least one of the data units is transferred using the 
socket and the thread assigned to the socket (box 530). 

The socket assignment technique of the present invention includes assigning a 
thread to an available socket. This assignment may be accomplished in a variety of 

10 techniques, including random, user-defined and, in a preferred embodiment, a round 
robin technique. The round robin technique assigns each thread to a first available 
socket. The random technique assigns a thread randomly to an available one of the 
plurality of sockets. The user-defined technique assigns a thread to an available one of 
the plurality of sockets as determined by a user. This may include, for example, every 

1 5 other socket, every third socket, or any other scheme determined by the user. 

In order to illustrate the above method of the present invention, a working 
example is presented. It should be noted that this example is only one of many 
implementations of the present invention that is possible, and is provided for illustrative 
purpose only. In order to provide a technique is scalable to handle high-volume traffic, 

20 incoming client requests must be demultiplexed into separate client requests. In this 
working example, a port and a local IP address (LI A) are used such that multiple socket 
may be bound to each. The kernel of the operating system must be able to support a 
socket implementation that allows efficient demultiplexing of the incoming request to the 
port and the LIA. The server then has multiple instantiations of sockets bound to the 

25 port and the LIA, with each socket servicing a different client that is accessing the socket 
pair. 

In order for the multi-thread, multi-socket technique of the present invention to be 
scalable, both the transmission and reception of data by the server must be parallized. 
In order to use multiple sockets bound to the port and the LIA, a reusable socket option 
30 was used. This option permits sockets to be reused. Transmitting data is then achieved 
by having thread serve one socket and transmitting data through the thread's 
corresponding socket (which is bound to the same LIA and port) in parallel. 

For the reception of data, a round robin technique of assigning sockets to 
processing threads was used. When a request was received at a LIA and port 
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combination, the request could potentially be serviced by any of the sockets bound to 
the LIA and port combination and the socket's corresponding thread. A round robin 
assignment technique is preferred because the client requests then are distributed 
evenly among the available sockets (and corresponding threads) thereby providing 
5 scalability by design. It is the server's responsibility to determine how to respond to each 
network request received through any of these sockets. Because all the sockets are 
serving the same LIA and port combination, theoretically it does not matter how the 
server handles the data request or transmission once the request reaches the server. 
Although a round robin assignment technique was used in this example, the 

1 0 present invention also includes other types of assignment techniques, such as random 
or user-defined. In this working example a round robin assignment technique was used 
because it required minimal modifications. All that was necessary was to use a round 
robin assignment technique instead of the current first fit assignment technique (or 
assigning the thread to the first socket). This means that virtually no change to the 

15 socket application programming interface (API) is needed. Moreover, minimal 

requirements are imposed on applications. Applications can incorporate scalability by 
using the reusable socket option available for most operating systems and extending the 
implementation of the application's functionality to encompass multiple threads. The 
application, however, is responsible to map the data received on a reused socket to the 

20 appropriate thread. 

The foregoing description of the preferred embodiment of the invention has been 
presented for the purposes of illustration and description. It is not intended to be 
exhaustive or to limit the invention to the precise form disclosed. Many modifications 
and variations are possible in light of the above teaching. It is intended that the scope of 

25 the invention be limited not by this detailed description of the invention, but rather by the 
claims appended hereto. 
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